Search CORE

55 research outputs found

An Ensemble-based Approach to Click-Through Rate Prediction for Promoted Listings at Etsy

Author: Aryafar Kamelia
Guillory Devin
Hong Liangjie
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/11/2017
Field of study

Etsy is a global marketplace where people across the world connect to make, buy and sell unique goods. Sellers at Etsy can promote their product listings via advertising campaigns similar to traditional sponsored search ads. Click-Through Rate (CTR) prediction is an integral part of online search advertising systems where it is utilized as an input to auctions which determine the final ranking of promoted listings to a particular user for each query. In this paper, we provide a holistic view of Etsy's promoted listings' CTR prediction system and propose an ensemble learning approach which is based on historical or behavioral signals for older listings as well as content-based features for new listings. We obtain representations from texts and images by utilizing state-of-the-art deep learning techniques and employ multimodal learning to combine these different signals. We compare the system to non-trivial baselines on a large-scale real world dataset from Etsy, demonstrating the effectiveness of the model and strong correlations between offline experiments and online performance. The paper is also the first technical overview to this kind of product in e-commerce context

arXiv.org e-Print Archive

Crossref

Remote Work Optimization with Robust Multi-channel Graph Neural Networks

Author: Guo Qi
Hong Liangjie
Wu Liang
Zhu Qinyi
Publication venue
Publication date: 26/08/2022
Field of study

The spread of COVID-19 leads to the global shutdown of many corporate offices, and encourages companies to open more opportunities that allow employees to work from a remote location. As the workplace type expands from onsite offices to remote areas, an emerging challenge for an online hiring marketplace is how these remote opportunities and user intentions to work remotely can be modeled and matched without prior information. Despite the unprecedented amount of remote jobs posted amid COVID-19, there is no existing approach that can be directly applied. Introducing a brand new workplace type naturally leads to the cold-start problem, which is particularly more common for less active job seekers. It is challenging, if not impossible, to onboard a new workplace type for any predictive model if existing information sources can provide little information related to a new category of jobs, including data from resumes and job descriptions. Hence, in this work, we aim to propose a principled approach that jointly models the remoteness of job seekers and job opportunities with limited information, which also suffices the needs of web-scale applications. Existing research on the emerging type of remote workplace mainly focuses on qualitative studies, and classic predictive modeling approaches are inapplicable considering the problem of cold-start and information scarcity. We precisely try to close this gap with a novel graph neural architecture. Extensive experiments on large-scale data from real-world applications have been conducted to validate the superiority of the proposed approach over competitive baselines. The improvement may translate to more rapid onboarding of the new workplace type that can benefit job seekers who are interested in working remotely

arXiv.org e-Print Archive

A Privacy-Preserving Hybrid Federated Learning Framework for Financial Crime Detection

Author: Dong Fan
Drew Steve
Hong Junyuan
Xue Liangjie
Zhang Haobo
Zhou Jiayu
Publication venue
Publication date: 18/04/2023
Field of study

The recent decade witnessed a surge of increase in financial crimes across the public and private sectors, with an average cost of scams of $102m to financial institutions in 2022. Developing a mechanism for battling financial crimes is an impending task that requires in-depth collaboration from multiple institutions, and yet such collaboration imposed significant technical challenges due to the privacy and security requirements of distributed financial data. For example, consider the modern payment network systems, which can generate millions of transactions per day across a large number of global institutions. Training a detection model of fraudulent transactions requires not only secured transactions but also the private account activities of those involved in each transaction from corresponding bank systems. The distributed nature of both samples and features prevents most existing learning systems from being directly adopted to handle the data mining task. In this paper, we collectively address these challenges by proposing a hybrid federated learning system that offers secure and privacy-aware learning and inference for financial crime detection. We conduct extensive empirical studies to evaluate the proposed framework's detection performance and privacy-protection capability, evaluating its robustness against common malicious attacks of collaborative learning. We release our source code at https://github.com/illidanlab/HyFL .Comment: PETs prize challenge versio

arXiv.org e-Print Archive

Redefining cardiac biomarkers in predicting mortality and adverse outcomes of inpatients with COVID-19

Author: Akolkar Gauri
Bai Liangjie
Blet Alice
Cai Jingjing
Chen Guohua
Chen Jing
Chen Manhua
Chen Ming-Ming
Chen Ze
Cheng Xu
Gao Chen
Huang Xiaodong
Huang Zizhen
Lei Fang
Li Hongliang
Lin Lijin
Liu Liming
Liu Peter P.
Liu Ye-Mao
Luo Pengcheng
Ma Xinliang
Mao Weiming
Qin Juan-Juan
She Zhi-Gang
Song Xiaohui
Touyz Rhian M.
Wang Haitao
Wang Yibin
Xia Meng
Xiang Da
Xie Jing
Xu Qingbo
Yan Youqin
Ye Ping
Yuan Yufeng
Zhang Bing-Hong
Zhang Peng
Zhang Xiao-Jing
Zhang Xingyuan
Zhao Ling-Ping
Zhou Feng
Zhu Lihua
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/10/2020
Field of study

The prognostic power of circulating cardiac biomarkers, their utility and pattern of release in coronavirus disease 2019 (COVID-19) patients have not been clearly defined. In this multi-centered retrospective study, we enrolled 3,219 patients with diagnosed COVID-19 admitted to 9 hospitals from December 31, 2019 to March 4, 2020, to estimate the associations and prognostic power of circulating cardiac injury markers with the poor outcomes of COVID-19. In the mixed-effect Cox model, after adjusting for age, gender and comorbidities, the adjusted hazard ratios of 28-day mortality for high-sensitivity cardiac troponin I (hs-cTnI) was 7.12 (95%CI, 4.60-11.03; P<0.001), NT-proB-type natriuretic peptide (NT-proBNP) was 5.11 (95%CI, 3.50-7.47; P<0.001), CK-MB was 4.86 (95%CI, 3.33-7.09; P<0.001), myoglobin was 4.50 (95%CI, 3.18-6.36; P < 0.001), and CK was 3.56 (95%CI, 2.53-5.02; P < 0.001). The cutoffs of those cardiac biomarkers for effective prognosis of 28-day mortality of COVID-19 were found to be much lower than for regular heart disease at about 49% of the currently recommended thresholds. Patients with elevated cardiac injury markers above the newly established cutoffs were associated with significantly increased risk of COVID-19 death. In conclusion, cardiac biomarker elevations are significantly associated with 28-day death in patients with COVID-19. The prognostic cutoffs for of these values might be much lower than the current reference standards. These findings can assist better management of COVID-19 patients to improve outcomes. Importantly, the newly established cutoff levels of COVID-19 associated cardiac biomarkers may serve as useful criteria for the future prospective studies and clinical trials

Crossref

Enlighten